Stochastic optimization of a cold atom experiment using a genetic algorithm 
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We employ an evolutionary algorithm to automatically optimize different stages of a cold atom 
experiment without human intervention. This approach closes the loop between computer based 
experimental control systems and automatic real time analysis and can be applied to a wide range 
of experimental situations. The genetic algorithm quickly and reliably converges to the most per- 
forming parameter set independent of the starting population. Especially in many-dimensional or 
connected parameter spaces the automatic optimization outperforms a manual search. 



In experimental research, scientists are often con- 
fronted with the task of finding ideal parameters to per- 
form a measurement. Although in modern setups, ex- 
perimental parameters are handled by computer-based 
control systems and also the evaluation of the experi- 
mental output is often performed by real-time computer 
systems, optimization is still essentially carried out by 
hand. 

In this article we present the application of a genetic 
algorithm to automatically optimize control parameters 
in an atomic physics experiment. Such algorithms have 
been employed to tackle optimization problems in differ- 
ent fields, ranging from mathematics and algorithmics [1] 
over economics [2j and aerospace engineering [3| to game 
theory 0]. Our algorithm generates a set of random 
starting parameters, automatically performs the experi- 
ment and recovers information about the performance of 
the parameter set from real-time analysis of the experi- 
mental result. This information is used to generate a next 
generation of parameters with better performance, until 
an optimum is reached (see figure[l}. This approach can 
be implemented in any experimental setup that requires 
optimization, provided that it is computer controlled and 
features real-time analysis. 

The experimental setup used in these investigations 
is a single-chamber atom chip setup for the creation 
and study of Bose- Einstein condensates (BEC) pjj]. Prc- 
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FIG. 1: The feedback loop between control system, experi- 
ment and real-time analysis is closed by the genetic algorithm. 
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cooled 87 Rb atoms are loaded into a magnetic trap cre- 
ated by current carrying macroscopic copper structures. 
Evaporative cooling is applied, creating a sample of 10 6 
atoms at microkelvin temperatures. These atoms are 
loaded onto an atom chip, where microscopic traps are 
generated by lithographically patterned wires [f| 0] , and 
cooled to quantum degeneracy in a second phase of evap- 
orative cooling. The wire structures on the atom chip 
allows a large variety of experiments like matter wave in- 
terferometry Q or the study of low-dimensional quantum 
gases @. After the experiment is performed, the atomic 
density distribution is recorded using standard absorp- 
tion or fluorescence imaging [lOj , either in the atom chip 
trap or in free time-of-flight expansion. As the atomic 
sample is destroyed in the imaging process, the above 
procedure has to be repeated many times when param- 
eters are changed and to accumulate statistics. The du- 
ration of an experiment is 35 s, the experimental cycle 
is repeated continuously and data taking can cover sev- 
eral days. It shall be pointed out that individual oper- 
ations within the experimental sequence take place on a 
timescale on the order of tens of microseconds, necessi- 
tating a relative temporal resolution on the 10 -6 level. 
A commercial real time control system (ADwin Pro I) in 
connection with a custom interface software coordinates 
the 60 involved individual devices. They are governed 
by a total parameter space of 2300 values during a typi- 
cal experimental sequence, a subspace of which is made 
accessible for each individual optimization task. 

Each experimental cycle produces an image of the 
atom cloud which yields a projected 2D measurement 
of the particle density distribution. Important physical 
quantities like the total number of atoms AT, the peak 
density uq, the position of the cloud or the phase space 
density can be derived already from a single picture. In 
terms of optimization theory, the above quantities can be 
interpreted as objective functions f{ji), where j% denotes 
a single parameter set out of the overall parameter space. 
Optimizing the experimental setup corresponds to find- 
ing a parameter set ji that maximizes (minimizes) a given 
objective function. Probing this function hence means 
performing the experimental sequence with a given pa- 
rameter set and determining the desired measurement 
value. Since in our experiments usually only few or even 
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FIG. 2: Block diagram of the optimization process. 



a single output quantity is considered, the problem is a 
prime example of a single-objective optimization prob- 
lem. Such tasks can be tackled effectively by stochastic 
algorithms 

The simplest approach would be to evaluate the ob- 
jective function on the entire parameter space. However, 
the number of necessary experimental runs grows rapidly 
with the dimensions of the parameter space. With 10 
evaluations per parameter, a scan over 1, 2, 3, 4 param- 
eters (i.e. 10, 100, 1000, 10000 parameter sets) takes 
about 6 min, 1 h, 10 h or 4 days respectively. 

An alternative approach are deterministic algorithms 
such as hill climbing [l 21 ] . where knowledge on the gra- 
dient of the objective function is used to generate im- 
proved parameter sets. They perform well for problems 
that behave approximately linear in the objective func- 
tion and guarantee convergence for unimodal problems 
represented by objective functions featuring only one ex- 
ternum. For multimodal problems, however, they are 
prone to converging against local extrema. 

Stochastic algorithms trade guaranteed convergence 
for shorter runtime in applications with a high number 
of (interacting) parameters (I3 |. They introduce a ran- 
dom element into the selection of points in parameter 
space, there is no formal way to prove whether a found 
solution is really the global optimum. Within stochastic 
algorithms, genetic algorithms belong to the class of par- 
allel optimization algorithms which do not rest on the 
evaluation of individual states in parameter space but 
rather evaluate the performance of an ensemble of states 
(generation). Compared with local algorithms, this ap- 
proach is less dependent on the specific initial conditions 
for each optimization run. Out of the different paral- 
lel methods, genetic algorithms are well explored, with 
numerous known implementational strategies (13 , [l4j . 

Figure 2 illustrates the working concept of our genetic 



algorithm. In a first step, a starting population is dis- 
tributed randomly in a parameter sub-space spanned by 
2-5 variables associated with a certain stage of the exper- 
iment. Sometimes population individuals near a known 
optimum or on the edges of parameter space are added 
to speed up convergence or test the performance of the 
algorithm. The size of the initial population has to be 
chosen carefully: Large populations are time consuming 
as they increase the number of evaluations per genera- 
tions as well as the number of generations. Too small 
starting populations have shown to lead to premature 
convergence as the diversity of the individuals decreases 
quickly. An intermediate initial population size of 10-15 
individuals has shown to perform well in various opti- 
mization tasks. 

To evaluate the performance of a population, the ex- 
perimental sequence is executed with the parameter set- 
tings represented by the individuals. The individuals are 
then sorted according to their performance with respect 
to the selected objective function, e.g. particle number 
or temperature. 

In analogy to evolution terminology, the fitness of a 
population individual F(jj) is a measure for the proba- 
bility for this state to be selected for reproduction. We 
have chosen linear ranking based fitness assignment [l5| , 
which considers the relative performance of individuals 
but is independent of the achieved absolute value of the 
objective function. It distributes the individual fitnesses 
between and 1: F(ji) — — (1 — ^Ey) where n is the size 
of the population. 

The selection process determines the parent individu- 
als that will 'breed' the next generation. From several 
possible selection schemes we have chosen stochastic uni- 
versal sampling [l3[ where parent individuals are cho- 
sen according to their fitness but which still introduces 
a stochastic process so that less performing individuals 
can contribute to the next generation and population di- 
versity is kept up. The total number of parent states 
is adjusted to 2/3 of the total population with 2 chil- 
dren per pair of parent states. Since the population size 
is chosen to be fixed throughout the optimization pro- 
cess, the free slots are filled up by the best states from 
the previous generation. This makes our algorithm 'eli- 
tist', assuring that the states showing best performance 
are not lost during the optimization process, speeding up 
convergence. 

The next generation individuals are created by inter- 
mediary recombination (l6t Il7j . The parameters of the 
two parent states are taken as limiting edges of a hy- 
percube in parameter space, the two child states are dis- 
tributed randomly within this volume. To avoid too rapid 
decrease of the children's parameter space, its volume is 
stretched by a factor 1.5 [T^, ■ 

Mutation is introduced to further impede convergence 
against local extrema. Its influence on the performance 
of genetic algorithms has been investigated in great detail 
and several well performing schemes are known (TtI [T3 | . 
For each state within one generation, there is a certain 
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FIG. 3: Positions of the individuals in the three-dimensional parameter space through seven generations. Convergence is clearly 
observed. On the bottom right, the achieved mean atom number for each generation is depicted. The error bars correspond to 
the standard deviation of the individual's results within each generation. 



probability for random alteration, in our case 1 %, with 
the additional constraint that only one of the state's 
variables is affected. The mutated values are obtained 
by a mechanism shown in [p|, that constructs j™ ut = 
ji + Si ■ rDi ■ 2~ uk , where jP 11 ' and ji denote the mutated 
and the source state respectively, Si randomly choses the 
sign of the mutation step and r defines the mutation 
range as a fraction of the accessible parameter space Di. 
The last factor designates the actual distribution, char- 
acterized by the random number u uniformly distributed 
in [0, 1] and the mutation precision k (Miihlcnbein's mu- 
tation). With r = 0.1 and k = 10 we have adapted 
established values [l6[ . After the mutation step the next 
child population is ready to be tested on the experiment. 

For simplicity, the optimization cycle is repeated for a 
preset number of 10 times or is stopped manually when 
the diversity in the population has dropped to the point 
that all states are equal. More evolved convergence cri- 
teria like the distance of individuals in parameter space 
or convergence of the objective function can be imple- 
mented easily. 

The described genetic algorithm has been used to opti- 
mize various stages of the experimental sequence Q like 
the optical pumping into magnetically trappable states 
or the evaporative cooling ramps. Figure 3 shows an 
example of an optimization of critical experimental pa- 
rameters during the phase of optical molasses, which ef- 
fectuates an additional cooling of the atoms coming from 
the magneto-optical trap before they are loaded into a 
magnetic wire trap. The three-dimensional parameter 
space is spanned by two magnetic fields (Up/Down and 
Bias), and the detuning of the cooling laser frequency 
with respect to the atomic transition. The two magnetic 
fields have to be optimized to compensate the earth's 
magnetic field and possible unknown stray fields, the 
laser frequency has to be optimized to achieve an effi- 
cient cooling (high detuning) while keeping the atoms 
confined (low detuning). As shown in figure 3 the ge- 
netic algorithm converges to optimum parameters within 



7 generations with a starting population of 10 individu- 
als. The overall runtime was 30 minutes, as compared to 
386 h it would need to sample the entire parameter vol- 
ume with the same resolution. Note that the optimum 
was found although the first two generations contained 
not a single well performing individual. The algorithm 
rapidly and reliably finds optimum values for various op- 
timization tasks and reproduces or outperforms manu- 
ally optimized parameters sets. For too small starting 
populations, it has shown to be prone to premature con- 
vergence leading to sub-optimal final parameters. Nu- 
merous characteristics of the algorithm such as the size 
of the starting population, the amount of reproducing in- 
dividuals per generation or the mutation rate and range 
can be adapted, weighing optimality against runtime. 

In future work we will extend the approach to objec- 
tive functions that cannot be evaluated in a single exper- 
imental run (image) but probe temporal dynamics of the 
system or the scattering of measured parameters around 
the mean value, requiring several measurements to evalu- 
ate a single parameter set. This will allow applications of 
the genetic algorithms beyond mere optimization tasks, 
like the reduction of excitations or the enhancement of 
squeezing or entanglement in dynamic matter wave in- 
terferometry [2fJ . Ultimately, we aim to develop " exper- 
imental optimal control" , in analogy to optimal control 
theory, where the result of the optimization directly pro- 
vides scientific insight, not accessible by other means. 

To conclude, we have implemented a robust and effi- 
cient stochastic genetic algorithm to automatically per- 
form time-consuming optimization tasks in a complex 
cold atom experiment. In the given example, the algo- 
rithm converged to optimal settings within 30 min, as 
compared to several hours typically required for manual 
search. Our approach closes the loop between computer- 
based experimental control systems and real-time data 
analysis and can be implemented in many situations in 
experimental research. 
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